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Quantum theory is usually formulated in terms of abstract mathematical postulates, involving 
Hilbert spaces, state vectors, and unitary operators. In this work, we show that the full formalism of 
quantum theory can instead be derived from five simple physical requirements, based on elementary 
assumptions about preparation, transformations and measurements. This is more similar to the 
usual formulation of special relativity, where two simple physical requirements - the principles of 
relativity and light speed invariance - are used to derive the mathematical structure of Minkowski 
space-time. Our derivation provides insights into the physical origin of the structure of quantum 
state spaces (including a group-theoretic explanation of the Bloch ball and its three-dimensionality), 
and it suggests several natural possibilities to construct consistent modifications of quantum theory. 



I. INTRODUCTION 

Quantum theory is usually formulated by postulating 
the mathematical structure and representation of states, 
transformations, and measurements. The general phys- 
ical consequences that follow (like violation of Bell-type 
inequalities [1], the possibility of performing state tomog- 
raphy with local measurements, or factorization of inte- 
gers in polynomial time [2] ) come as theorems which use 
the postulates as premises. In this work, this procedure 
is reversed: we impose five simple physical requirements, 
and this suffices to single out quantum theory and derive 
its mathematical formalism uniquely This is more sim- 
ilar to the usual formulation of special relativity, where 
two simple physical requirements — the principles of rela- 
tivity and light speed invariance — are used to derive the 
mathematical structure of Minkowski space-time and its 
transformations. 

The requirements can be schematically stated as: 

1. In systems that carry one bit of information, each 
state is characterized by a finite set of outcome 
probabilities. 

2. The state of a composite system is characterized 
by the statistics of measurements on the individual 
components. 

3. All systems that effectively carry the same amount 
of information have equivalent state spaces. 

4. Any pure state of a system can be reversibly trans- 
formed into any other. 

5. In systems that carry one bit of information, all 
mathematically well-defined measurements are al- 
lowed by the theory 

These requirements are imposed on the framework of 
generalized probabilistic theories [3-9] , which already as- 



sumes that some operational notions (preparation, mix- 
ture, measurement, and counting relative frequencies of 
measurement outcomes) make sense. Due to its concep- 
tual simplicity, this framework leaves room for an infini- 
tude of possible theories, allowing for weaker- or stronger- 
than-quantum non-locality [6, 10-14]. In this work, we 
show that quantum theory (QT) and classical probabil- 
ity theory (CPT) are very special among those theories: 
they are the only general probabilistic theories that sat- 
isfy the five requirements stated above. 

The non-uniqueness of the solution is not a problem, 
since CPT is embedded in QT, thus QT is the most gen- 
eral theory satisfying the requirements. One can also 
proceed as Hardy in [4] : if Requirement 4 is strengthened 
by imposing continuity of the reversible transformations, 
then CPT is ruled out and QT is the only theory satisfy- 
ing the requirements. This strengthening can be justified 
by the continuity of time evolution of physical systems. 

It is conceivable that in the future, another theory may 
replace or generalize QT. Such a theory must violate at 
least one of our assumptions. The clear meaning of our 
requirements allows to straightforwardly explore poten- 
tial features of such a theory. The relaxation of each of 
our requirements constitutes a different way to go beyond 
QT. 

The search for alternative axiomatizations of quantum 
theory (QT) is an old topic that goes back to Birkhoff 
and von Neumann [8] , and has been approached in many 
different ways: extending propositional logic [7, 8], using 
operational primitives [3-6, 9], searching for information- 
theoretic principles [5, 6, 10, 11, 19-21], building upon 
the phenomenon of quantum nonlocality [6, 10-13]. Alf- 
sen and Shultz [22] have accomplished a complete char- 
acterization of the state spaces of QT from a geometric 
point of view, but the result does not seem to have an 
immediate physical meaning. In particular, the fact that 
the state space of a generalized bit is a three-dimensional 
ball is an assumption there, while here it is derived from 



physical requirements. 

This work is particularly close to [4, 19], from where it 
takes some material. More concretely, the multiplicativ- 
ity of capacities and the Simplicity Axiom from [4] are 
replaced by Requirement 5. In comparison with [19], the 
fact that each state of a generalized bit is the mixture 
of two distinguishable ones, the maximality of the group 
of reversible transformations and its orthogonality, and 
the multiplicativity of capacities, are also replaced by 
Requirement 5. 

Summary of the paper. Section II contains an introduc- 
tion to the framework of generalized probabilistic theo- 
ries, where some elementary results are stated without 
proof. In Section III the five requirements and their sig- 
nificance are explained in full detail. Section IV is the 
core of this work. It contains the characterization of all 
theories compatible with the requirements, concluding 
that the only possibilities are CPT and QT. The Conclu- 
sion (Section V) recapitulates the results and adds some 
remarks. The Appendix contains all lemmas and their 
proofs. 



II. GENERALIZED PROBABILISTIC 
THEORIES 

In CPT there can always be a joint probability distri- 
bution for all random variables under consideration. The 
framework of generalized probabilistic theories (GPTs), 
also called convex operational framework, generalizes this 
by allowing the possibility of random variables that can- 
not have a joint probability distribution, or cannot be si- 
multaneously measured (like noncommuting observables 
in QT). 

This framework assumes that at some level there is a 
classical reality, where it makes sense to talk about exper- 
imentalists performing basic operations such as: prepa- 
rations, mixtures, measurements, and counting relative 
frequencies of outcomes. These are the primary concepts 
of this framework. It also provides a unified way for all 
GPTs to represent states, transformations and measure- 
ments. A particular GPT specifies which of these are 
allowed, but it does not tell their correspondence with 
actual experimental setups. On its own, a GPT can still 
make nontrivial predictions like: the maximal violation 
of a Bell inequality [1], the complexity-theoretic compu- 
tational power [2, 18], and in general, all information- 
theoretic properties of the theory [6] . 

The framework of GPTs can be stated in different 
ways, but all lead to the same formalism [3-9]. This for- 
malism is presented in this section at a very basic level, 
providing some elementary results without proofs. 



A. States 

Definition of system. To a setup like FIG. 1 we asso- 
ciate a system if for each configuration of the prcpara- 
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FIG. 1: General experimental set up. From left to right 
there are the preparation, transformation and measurement 
devices. As soon as the release button is pressed, the prepa- 
ration device outputs a physical system in the state specified 
by the knobs. The next device performs the transformation 
specified by its knobs (which in particular can be "do noth- 
ing"). The device on the right performs the measurement 
specified by its knobs, and the outcome (x or x) is indicated 
by the corresponding light. 



tion, transformation and measurement devices, the rela- 
tive frequencies of the outcomes tend to a unique proba- 
bility distribution (in the large sample limit). 

The probability of a measurement outcome x is de- 
noted by p{x). This outcome can be associated to a 
binary measurement which tells whether x happens or 
not (this second event x, has probability p(x) = l—p(x)). 
The above definition of system allows to associate to each 
preparation procedure a list of probabilities for the out- 
comes of all measurements that can be performed on a 
system. As we show in Subsection IV C below, our re- 
quirements imply that all these probabilities p{x) are de- 
termined by a finite set of them; the smallest such set is 
used to represent the state 
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The measurement outcomes that characterize the state 
Xi, . . . , Xd are called fiducial, and in general, there is more 
than one set of them (for example, a —spin particle in QT 
is characterized by the spin in any 3 linearly-independent 
directions). Note that each of the fiducial outcomes can 
correspond to a different measurement. The redundant 
component ip = 1 is reminiscent of QT, where one of the 
diagonal entries of a density matrix is redundant, since 
they sum up to 1. In fact ip° ^ 1 is sometimes used 
to represent unnormalized states, but not in this paper, 
where only normalized states arc considered. The re- 
dundant component tp° allows to use the tensor-product 
formalism in composite systems (Subsection II D), which 
simplifies the notation. 

The set of all allowed states S is convex [23], because 
if ipi , ip2 65 then one can prepare ipi with probability q 
and ip2 with probability 1 — q, effectively preparing the 
state qipi + (1 — q)ijj2- The number of fiducial probabilities 
d is equal to the (affine) dimension of S, otherwise one 
fiducial probability would be functionally related to the 
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others, and hence redundant. 

Suppose there is a R rf+1 -vector tp S which is in the 
topological closure of S - that is, ip can be approximated 
by states ij)' € S to arbitrary accuracy. Since there is 
no observable physical difference between perfect prepa- 
ration and arbitrarily good preparation, we will consider %j) 
to be a valid state and add it to the state space. This does 
not change the physical predictions of the theory, but it 
has the mathematical consequence that state spaces be- 
come topologically closed. Since state vectors (f) arc 
bounded, and we are in finite dimensions (shown in Sub- 
section IV C), state spaces S are compact convex sets [23] . 

The pure states of a state space S are the ones that 
cannot be written as mixtures: ip 7^ Qipi + (1 — 9)^2 with 
7^ and < q < 1. Since S is compact and convex, 
all states are mixtures of pure states [23]. 

B. Measurements 

The probability of measurement outcome x when 
the system is in state ip € S is given by a function 
Q. x {ip). Suppose the system is prepared in the mixture 
qijji + (I — q)i/>2, then the relative frequency of outcome 
x does not depend on whether the label of the actual 
preparation ipk is ignored before or after the measure- 
ment, hence 

+ (1 - qH'2) = gfixWn) + (!- ?)^(V>2) ■ 

This means that the function fl x is affine on S. The 
redundant component ip° in (1) allows to write this func- 
tion as a linear map ft x : R d+1 — > R [3, 6]. 

An effect is a linear map f2 : K d+1 — > M such that 
^(■0) G [0, 1] for all states ip € S. Every function Sl x as- 
sociated to an outcome probability p(x) is an effect. The 
converse is not necessarily true: the framework of GPTs 
allows to construct theories where some effects do not 
represent possible measurement outcomes. These restric- 
tions are analogous to superselection rules, where some 
(mathematically well-defined) states are not allowed by 
the physical theory This is related to Requirement 5. 
A tight effect Q is one for which there are two states 
V'cV'i 65 satisfying ri(^o) = and fi(V>i) = I. 

An n-outcome measurement is specified by n effects 
fii, . . . , fi n such that Oi(^>) + • • • + 0, n (ip) = 1 for all 
tj) G S. The number f2 a (V0 is the probability of outcome 
a when the measurement is performed on the state ip. 
The states ipx, . . . , tp n are distinguishable if there is an 
n-outcome measurement such that f2 Q (?/>6) = Sa,b, where 
$a,b = 1 if a = b, and S a ,b = if a 7^ b. 

The capacity of a state space S is the size of the largest 
family of distinguishable states, and is denoted by c. This 
is the amount of classical information that can be trans- 
mitted by the corresponding type of system, in a single- 
shot error-free procedure. (In QT the capacity of a sys- 
tem is the dimension of its corresponding Hilbert space; 
which must not be confused with the dimension of the 
state space d = c 2 — I, that is, the set of c x c complex 



matrices that are positive and have unit trace.) A com- 
plete measurement on S is one capable of distinguishing 
c states. 

C. Transformations 

Each type of system has associated to it: a state space, 
a set of measurements, and a set of transformations. A 
transformation T is a map T : S — > S. Similarly as 
for measurements, if a state is prepared as a mixture 
qijji + (I — q)ijj2, it docs not matter whether the label of 
the actual preparation ipk is ignored before or after the 
transformation. Hence 

T (# x + (I - q)i> 2 ) = qT^x) + (I - q)T(i> 2 ) , 

which implies that T is an affine map. The redundant 
component ip° in (1) allows to extend T to a linear map 
T : R d+1 R d+1 [3, 6]. 

A transformation T is reversible if its inverse T -1 ex- 
ists and belongs to the set of transformations allowed 
by the theory. The set of (allowed) reversible transfor- 
mations of a particular state space iS forms a group Q. 
For the same reason as for the state space itself, we will 
assume that the group of reversible transformations is 
topologically closed. Previously we have seen that a state 
space S is bounded, hence the corresponding group of 
transformations Q is bounded, too. In summary, groups 
of transformations are compact [24] . 

D. Composite systems 

Definition of composite system. Two systems A, B con- 
stitute a composite system, denoted AB, if a measure- 
ment for A together with a measurement for B uniquely 
specifies a measurement for AB. This means that if x and 
y are measurement outcomes on A and B respectively, 
the pair (x, y) specifics a unique measurement outcome 
on AB, whose probability distribution p(x, y) does not 
depend on the temporal order in which the subsystems 
are measured. 

The fact that subsystems are themselves systems im- 
plies that each has a well-defined reduced state ipAj^B 
which does not depend on which transformations and 
measurements are performed on the other subsystem (see 
definition of system in Subsection II A) . This is often re- 
ferred to as no-signaling. Let x±, . . . , Xd A be the fiducial 
measurements of system A, and y%, . . . ,yd B the ones of 
B. The no-signaling constraints are 

p(xi) = p{xi,yj)+p(xi,yj) _ 
PiVi) = P{xi,yj)+p{xi,yj) 

for all i,j. 

An assumption which is often postulated additionally 
in the GPT context is Requirement 2, which says that the 
state of a composite system is completely characterized 
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by the statistics of measurements on the subsystems, that 
is, p(x,y). This and no-signaling (2) imply that states 
in AB can be represented on the tensor product vector 
space [3] as 
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The joint probability of two arbitrary local measurement 
outcomes x, y is given by 



p(x, y) = (n x ® n y )(tjj AB ) 



(4) 



where fl x is the effect representing x in A, that is 
p(x) = Q x (iPa), and analogously for Cl y [3]. (The term 
"local" is used when referring to subsystems, and has 
nothing to do with spatial locations.) In other words, 
if {fi^, . . . , O^} i s an "--outcome measurement on A 1 
and {Of , . . . , 0^} i s an ^--outcome measurement on 
B, then ® Of | a = 1, . . . , n; b = 1, . . . , m} defines a 
measurement on AB with nm outcomes. Local transfor- 
mations act on the global state as 



AB 



(T a ®T b )(iPab) , 



(5) 



where Ta is the matrix that represents the transforma- 
tion in A, and analogously for Tb [3]. The reduced states 
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are obtained from ipAB by picking the right components 
(3). Alternatively, reduced states can be defined by 
Q A (ip A ) = (Oa <g> 1)(iPab) for any effect Qa in A, where 
l{ipB ) = V'b is the unit effect. The reduced state ipA 
must belong to the state space of subsystem A, denoted 
S A , and any state in 5,4 must be the reduction of a state 
from S A b- (Analogously for subsystem B.) This implies 
that all product states 
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it A ® Ipl 



(7) 



are contained in Sab [3], and similarly, all tensor prod- 
ucts of local measurements and transformations are al- 
lowed on AB. 

Given two fixed state spaces Sa and Sb, the previous 
discussion imposes constraints on the state space of the 
composite system Sab- However, there are still many dif- 
ferent possible joint state spaces Sab, and some of them 



allow for larger violations of Bell inequalities than QT. In 
fact, this has been extensively studied [5, 6, 10-14], and 
is one of the reasons for the popularity of generalized 
probabilistic theories. 

Nothing prevents Bob's system from being composite 
itself; hence one can recursively extend the definition of 
composite system and formulas (3), (4), (5), and (7) to 
more parties. 



E. Equivalent state spaces 

Let C : S — > S' be an invertible affine map. If all 
states are transformed as ijj — > £{ip), and all effects on 
S are transformed as O — > Oo£ _1 , then the outcome 
probabilities fl(ip) are kept unchanged. Analogously, if 
all transformations on iS are mapped as T — > CoT o C^ 1 
then their action on the states is the same. The new 
state space 5', together with the transformed effects and 
transformations, is then just a different representation of 
S. In this case, we call S and S f equivalent. In the new 
representation, the entries of ip need not be probabilities 
as in (1), but it may have other advantages. In this work, 
several representations are used. 

In the standard formalism of QT, states are repre- 
sented by density matrices, however they can also be 
represented as in (1). 

Changing the set of fiducial measurements is a particu- 
lar type of ^-transformation. For example, if the compo- 
nents of the Bloch vector (of a quantum spin-i particle) 
correspond to spin measurements in non-orthogonal di- 
rections, then the Bloch sphere becomes an ellipsoid. 



F. Instances of generalized probabilistic theories 

QT is an instance of GPT, and can be specified as 
follows. The state space S c with capacity c is equivalent 
to the set of complex c x c-matrices p such that p > 
and trp = 1. This set has dimension d c = c? — 1, and 
its pure states are rank-one. The effects on S c have the 
form O(p) = tr(Mp), where M is a complex exc-matrix 
such that < M < I. The reversible transformations 
act as p — > VpV^ with V € SU(c). The capacity of a 
composite system AB is the product of the capacities for 
the subsystems cab = cacb- 

CPT is another instance of GPT, and can be speci- 
fied as follows. The state space S c with capacity c is 
equivalent to the set of c-outcomc probability distribu- 
tions [p(l), . . . ,p(c)], which has dimension d c = c — 1 
(in geometric terms, each iS c is a simplex). The pure 
states are the deterministic distributions p(a) = <5 a ,b with 
b = l,...,c. The c-outcome measurement with effects 
O a ('0) = p{a) for a = 1, . . . , c, distinguishes the c pure 
states, hence it is complete. Any other measurement is a 
function of this one. The reversible transformations act 
by permuting the entries of the state [p(l), . . . ,p(c)]. The 
capacity of a composite system is also cab — caC-b- Note 
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that CPT can be obtained by restricting the states of QT 
to diagonal matrices. In other words, CPT is embedded 
in QT. 

An instance of GPT that is not observed in nature 
is generalized no-signaling theory [6], colloquially called 
boxworld. By definition, state spaces contain all correla- 
tions (3) satisfying the no-signaling constraints (2). Such 
state spaces have finitely many pure states, and some of 
them violate Bell inequalities stronger than any quantum 
state [12]. The effects in boxworld arc all generated by 
products of local effects. The group of reversible trans- 
formations consists only of rclabellings of local measure- 
ments and their outcomes, permutations of subsystems, 
and combinations thereof [14]. 

III. THE REQUIREMENTS 

This section contains the precise statement of the re- 
quirements, each followed by explanations about its sig- 
nificance. 

Requirement 1 (Finiteness). A state space with capac- 
ity c = 2 has finite dimension d. 

If this did not hold, the characterization of a state of 
a generalized bit would require infinitely many outcome 
probabilities, making state estimation impossible. It is 
shown below that this requirement, together with the 
others, implies that all state spaces with finite capacity 
c have finite dimension. 

Requirement 2 (Local tomography). The state of a 
composite system AB is completely characterized by the 
statistics of measurements on the subsystems A,B. 

In other words, state tomography [3] can be performed 
locally. This is equivalent to the constraint 

(d A B + l) = (d A + l)(d B + l) (8) 

[3, 4]. This requirement can be recursively extended to 
more parties by letting subsystems A, B to be themselves 
composite. 

Requirement 3 (Equivalence of subspaces). Let S c and 
<S c -i be systems with capacities c and c~l, respectively. 
If Cli, . . . , Cl c is a complete measurement on S c , then the 
set of states ip € S c with Q c (ip) = is equivalent to S c _±. 

The notions of complete measurements and equivalent 
state spaces are defined in Subsections II B and HE. In 
particular, equivalence of 6> c _i and 

S' c _ x ~ Q C (V>) =0}C5 C (9) 

implies that all measurements and reversible transforma- 
tions on one of them can be implemented on the other. 

This requirement, first introduced in [4], implies that 
all state spaces with the same capacity are equivalent: if 
<S c _i and iS c _i are state spaces with capacity c — 1, then 



both are equivalent to (9), hence they are equivalent to 
each other. In other words, the only property that char- 
acterizes the type of system is the capacity for carrying 
information. If we start with S c and apply Requirement 3 
recursively, we get a more general formulation: consider 
any subset of outcomes {ai, . . . , a c <} C {1, . . . , c} of the 
complete measurement Qi, . . . , fi c , then the set of states 
tp G S c with 

n 01 (^) + -" + fi 0e ,M = i (io) 

is equivalent to the state space iS c ' with capacity d . 
This provides an onion-like structure for all state spaces 
Si c S 2 c S 3 c • • • 

The particular structure of QT simplifies the task of 
assigning a state space to a physical system or experi- 
mental setup. It is not necessary to consider all possible 
states of the system, but instead, the relevant ones for 
the context being analyzed. For example, an atom is 
sometimes modeled with a state space having two dis- 
tinguishable states (c = 2), even though its constituents 
have many more degrees of freedom. In particular, if we 
know that only two energy levels are populated with non- 
zero probability, we can ignore all others and effectively 
get a genuine quantum 2-level state space. In a theory 
where this is not true, the effective state space might 
depend on how many unpopulated energy levels are ig- 
nored, or on the detailed internal state of the electron, 
for example. In order to avoid pathologies like this, we 
postulate Requirement 3. 

Requirement 4 (Symmetry). For every pair of pure 
states ipi 7 ip2 £ S there is a reversible transformation G 
mapping one onto the other: G{ip\) = 

The set of reversible transformations of a state space 
S c forms a group, denoted Q c . This group endows S c with 
a symmetry, which makes all pure states equivalent. A 
group Q c is said to be continuous if it is topologically con- 
nected: any transformation is the composition of many 
infinitesimal ones [24]. Hardy invokes the continuity of 
time-evolution in physical systems to justify the conti- 
nuity of reversible transformations [3, 4]; in this case, 
state spaces S c must have infinitely- many pure states; 
this rules out CPT and singles out QT. However, all the 
analysis in this work is done without imposing continu- 
ity, since we find it very interesting that the only theory 
with state spaces having finitely-many pure states, and 
satisfying the requirements, is CPT. 

Requirement 5 (All measurements allowed). All effects 
on 1S2 are outcome probabilities of possible measurements. 

It is shown below that, in combination with the other 
requirements, this implies that all effects on all state 
spaces (with arbitrary c) appear as outcome probabilities 
of measurements in the resulting theory. Note that Re- 
quirement 5 has non-trivial consequences in conjunction 
with the other requirements: adding effects as allowed 
measurements to a physical theory extends the applica- 
bility of Requirement 3. 
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For completeness, we would like to mention that Re- 
quirement 5 can be replaced by the following postulate, 
which has first been put forward in an interesting paper 
that appeared after completion of this work [34] . It calls 
a state "completely mixed" if it is in the relative interior 
of state space. See Lemma 9 in the appendix for how the 
proof of our main result has to be modified in this case. 

Requirement 5' [34]. If a state is not completely 
mixed, then there exists at least one state that can be 
perfectly distinguished from it. 

IV. CHARACTERIZATION OF ALL THEORIES 
SATISFYING THE REQUIREMENTS 




FIG. 2: The left figure is a state space whose boundary con- 
sists of facets (like = 1). Each facet contains infinitely many 
states (fi = 1 contains ip\, %p2 and all Vmix = qipi + (1 — q)ip2) ■ 
The right figure is a state space whose boundary has no 
facets. Any state space has supporting hyperplanes contain- 
ing a unique state (like Q, on c — 1 in both figures). 



A. The maximally-mixed state 

We use the following notation: the system with ca- 
pacity c has state space S c with dimension d c and group 
of reversible transformations Q c . The group Q c is com- 
pact (Section II C), and hence, has a normalized invariant 
Haar measure [26]. This allows to define the maximally- 
mixed state 



fir 



G{^)dG eS c 



(11) 



where ip £ S c is an arbitrary pure state. It follows from 
Requirement 4 that the resulting state fi c does not de- 
pend on the choice of the pure state ip. By construction, 
the maximally- mixed state is invariant: 



G(nc) = \± c for all G £ Q c 



(12) 



Moreover, Lemma 1 shows that it is the only invariant 
state in S c (this lemma and all others are stated and 
proven in the appendix). 



B. The generalized bit 

A generalized bit is a system with capacity two. For 
any state ip £ S2 in the standard representation (1), its 
Bloch representation is defined by 



P{xi) 



P{Xd 2 ) 



6& C 



(13) 



States in the Bloch representation do not have the re- 
dundant component tp°, so equations (4, 5, 7) become 
less simple. The invertible map C : S2 —> S2 is affinc 
but not linear; hence, effects 51 in the Bloch representa- 
tion (ft = flo C~ r ) arc affinc but not necessarily linear. 
The same applies to transformations (G = C o G o C^ 1 ), 
however, the maximally-mixed state in the Bloch repre- 
sentation is the null vector fa = 0, therefore (12) be- 
comes G(0) = 0, which implies that G acts linearly (as 
a matrix). 



Theorem 1. A state in S2 is pure if and only if it belongs 
to the boundary dS2- 

Proof. In any convex set, pure states belong to the 
boundary [23]. Let us see the converse. 

It is shown in [27] that any compact convex set has 
a supporting hyperplane containing exactly one point of 
the set. Translated to our language: there is a tight effect 
fiono on 52 such that only one state </? nc £ S2 satisfies 
ftone(^one) = L this is illustrated in FIG. 2. Accord- 
ing to Requirement 5, the effect Cl onc corresponds to a 
valid measurement outcome, and so does 1 — ft onc , where 
l(-0) = 1 for all \jj £ 6> 2 - Thus, the two effects tl onc and 
1 — fionc define a complete measurement on S2- Impos- 
ing Requirement 3 on the single outcome Cl ono constrains 
the state space with unit capacity 1S1 to contain only one 
state. 

Suppose there is a point in the boundary (p m i x £ dS2 
which is not pure: ip m i x = 101 + (1 — <z)<^2 with ipi ^ 
if>2 and < q < 1. Every point in the boundary of a 
compact convex set has a supporting hyperplane which 
contains it [23]. In our language: there is a tight effect 
f2 on £2 such that ft(^ m ix) = L The affine function Cl 
is bounded: 0,(tp) < 1 for any if £ S2, which implies 
Cl(<pi) = 0(^2) = 1; this is illustrated in FIG. 2. Like 
f2 onc , the effect Q defines a complete measurement, and 
Requirement 3 can be imposed on the single outcome Cl, 
implying that Si contains more than one state. This is 
in contradiction with the previous paragraph; hence, all 
points in the boundary are pure. □ 

For the case tfe = 1, the state space 1S2 is a segment 
(a 1-dimensional ball), hence the previous and next theo- 
rems are trivial. For d 2 > 1, the previous theorem implies 
that 1S2 contains infinitely-many pure states. The next 
theorem recovers the (quantum-like) Bloch sphere with 
a yet unknown dimension c? 2 - 

Theorem 2. There is a set of fiducial measurements for 
which S2 is a d2~ dimensional unit ball. 

Proof. Lemma 2 shows that there is an invertible real 
matrix S such that for each G £ G2 the matrix SGS' 1 
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is orthogonal. Let us redefine the set S2 by transform- 
ing the states as <p — * tp' — qStp, where the number 
q > is chosen such that all pure states are unit vec- 
tors \<p'\ 2 = Cp n (p' = 1. This is possible because in the 
transformed state space, all pure states are related by 
orthogonal matrices (SGS^ 1 ) which preserve the norm. 
Since Theorem 1 also applies to the redefined set S' 2 , it 
must be a unit ball. In what follows we define a new set of 
fiducial measurements x\ such that the Bloch represen- 
tation (13) associated to the new fiducial probabilities 
p(x' i ) coincides with the redefinition tp' . 

Requirement 5 tells that in S 2 , all tight effects are al- 
lowed measurements. For each unit vector v £ M. d2 the 
function Clp(<p') = (1 + v T tp')/2 is a tight effect on the 
unit ball, and conversely, all tight effects on the unit ball 
are of this form. The new set of fiducial measurements 
x ■ has effects £l x > = 0/> 4 , where 



' 1 " 




' o" 







1 

















(14) 



is a fixed orthonormal basis for R d2 . For any state tp' 
the new fiducial probabilities are p(x' i ) = Q x i(tp f ) = 
(1 + <p H )/2, which implies <p H = 2[p(x[) ~ 1/2]/ This is 
just (13) with the new fiducial measurements (note that 
jx' 2 = and $ = fi x ' 4 (0) = 1/2). □ 

In the rest of the paper, we will use the representation 
derived in Theorem 2 above, where the generalized bit is 
represented by a unit ball. Moreover, we will drop the 
prime in S 2 , x[, tp' used in the proof, and simply write 
<S 2 , Xi, tp. 

As argued above, for each pure state ip £ 5 2 there is a 
binary measurement with associated effect 



SlyM) = (1 + £ T V)/2 , 



(15) 



such that £lip(<p) = 1 and Cl v (—tp) = 0. In summary, 
there is a correspondence between tight effects and pure 
states in <S 2 , and each pure state belongs to a distinguish- 
able pair {tp, —tp}. 



capacity c there is a value of m such that c < c m , hence 
by Requirement 3 we have S c C S 2 x m , which implies that 
S c is finite-dimensional. 

In QT, the maximally- mixed state (11) has two con- 
venient properties. First property: if /ia and [1b are the 
maximally- mixed states of systems A and B, then the 
maximally-mixed state of the composite system AB is 



I^ab = <X> [IB 



(16) 



Second property: in the state space iS c , there are c pure 
distinguishable states ip\ , . . . , ip c £ S c such that 



r — ' 



(17) 



Lemmas 3 and 5 show that these two properties hold for 
every theory satisfying our requirements. The following 
theorem exploits these properties to show that the ca- 
pacity is multiplicative (one of the axioms in [4]). 

Theorem 3. If ca and cb are the capacities of systems 
A and B , then the capacity of the composite system AB 
is 



CAB = CACB ■ 



(18) 



Proof. Equation (17) allows to write the maximally- 
mixed states of systems A and B as 



1 



CA 

VA = — J2<Pa 

CA 



P>B = 



-, CB 

cb f-f 



6=1 



where <p^ 



, ifc A £ Sa are pure and distinguishable, 
and tpf , . . . , £ Sb are pure and distinguishable, too. 
This and equation (16) imply 



\±ab = HA® [i-B 



CACB 



(19) 



a=l 6=1 



All states tpf <8> <p B £ Sab are distinguishable with the 
tensor-product measurement, therefore 



cab > cacb 



(20) 



C. Capacity and dimension 

Requirements 1, 2 and 3 imply that a state space with 
finite capacity c has finite dimension d c , which generalizes 
Requirement 1. To see this, consider a system composed 
of m generalized bits, with state space denoted by S 2 xm . 
Since cfo is finite, equation (8) implies that iS 2 xm has finite 
dimension. Due to the fact that perfectly distinguishable 
states are linearly independent, its capacity, denoted c m , 
must be finite, too. Since systems with the same capacity 
are equivalent, we must have c m 7^ c„ for m 7^ n, and 
the sequence of integers c\ , c 2 , . . . is unbounded. For any 



Let (fii 



,Q. CAB ) be a complete measurement on AB 



which distinguishes the states ipi,..., ipc AB & Sab'-, that 
is fifc(V'fc') = <5fc,fc'- According to Lemma 4 these states 
can be chosen to be pure. Since X)fcii^fc(MAs) = L 
there is at least one value of k, denoted fcp, such that 



^fcJ/Ms) < 1 1 cab ■ 



(21) 



The product of pure states tpf (g> tpf is pure [3] , hence Re- 
quirement 4 tells that there is a reversible transformation 
G £ Qab such that G(tpk ) = Pi ® <Pi ■ The measure- 
ment (f2i o G _1 , . . . , Vt CAB o G^ 1 ) distinguishes the states 
G(ipi ),•■•, G(ip CAB ). Inequality (21), the invariance of 
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fj-AB, expansion (19), the positivity of probabilities, and 
(Q ko oG- 1 )(pf»^) = l, imply 

— > (n ko o G-^iflAB) 
CAB 

= — J2 o G- 1 )^ ® ^ ) > . 
cacb c A c B 

a,o 

This and (20) imply (18). □ 

It is shown in [4] that the two multiplicativity formulas 
(8) and (18) imply the existence of a positive integer r 
such that: for any c the state space S c has dimension 

d c = c r - 1 . (22) 

The integer r is a constant of the theory, with values 
r = 1 for CPT and r = 2 for QT. 

D. Recovering classical probability theory 

Let us consider all theories with d 2 = 1- In this case, 
equation (22) becomes d c = c— 1. In [4], it is shown that 
the only GPT with this relation between capacity and 
dimension is CPT, as described in Subsection II F. We 
reproduce the proof for completeness. 

Theorem 4. The only GPT with d 2 = 1 satisfying Re- 
quirements 1-5 is classical probability theory. 

Proof. Let S c be a state space and . . . ,Q C ) a 
complete measurement which distinguishes the states 
ipi, . . . , ip c S <S C . The vectors tf>i , . . . , tf> c € R c are linearly 
independent; otherwise ip a = J2b^a ^ b ^ h an d 1 = ^a(V'a) 
= J^b^a^b^ai^b) = gives a contradiction. Therefore, 
any state ip € S c C R c can be written in this basis 
ip = J2 a la i>a where q a = fl a (ip) turns out to be the 
probability of outcome a. The numbers (q±, . . . , q c ) con- 
stitute a probability distribution, hence, there is a one-to- 
one correspondence between states in S c and c-outcome 
probability distributions. This kind of set is called a 
d c -simplex. A similar argument shows that the effects 
f2i, . . . , Q c are linearly independent. Hence, any effect Q 
on S c can be written as Q — h a £l a , and the constraint 
< £l(ipa) < 1 implies < h a < 1. In other words, every 
measurement on S c is generated by the complete one. 

Every reversible transformation on iS c is a symmetry of 
the <i c -simplex, that is, a permutation of pure states. Due 
to Requirement 4, there is a reversible transformation on 
the bit 1S2 which exchanges the two pure states. Using 
Requirement 3 inductively: if there is a transformation 
on S c -i which exchanges two pure states and leaves the 
rest invariant, this transposition can be implemented on 
iS c , also leaving all other pure states invariant. Therefore, 
all transpositions can be implemented in S c , and those 
generate the full group of permutations. □ 



E. Reversible transformations for the generalized 

bit 

In the rest of the paper, only theories with c?2 > 1 are 
considered. Theorem 2 shows that £2 is a (^-dimensional 
unit ball. Equation (22) for c = 2 implies that c?2 is odd. 
The pure states in S2 are the unit vectors in M. d2 . A 
reversible transformation G G Qi maps pure states onto 
pure states, hence it preserves the norm and has to be an 
orthogonal matrix G T = G _1 . Therefore O2 is a subgroup 
of the orthogonal group 0(c?2). 

Requirement 4 imposes that for any pair of unit vec- 
tors tp,tp' there is G G G2 such that G((p) = ip' . In 
other words, Q2 is transitive on the sphere [28, 29]. Ac- 
cording to Lemma 6, if Q2 is transitive on the sphere, 
then the largest connected subgroup C2 Q Qi is also 
transitive on the sphere. The matrix group C2 is com- 
pact and connected, hence a Lie group (Theorem 7.31 
in [24]). The classification of all connected compact Lie 
groups that are transitive on the sphere is done in [28, 29] . 
For odd c?2, the only possibility is C2 = SOfa), ex- 
cept for c?2 = 7 where there are additional possibilities: 
C 2 = MG 2 M T C SO(7) for any M € 0(7), where G 2 
is the fundamental representation of the smallest excep- 
tional Lie group [30]. For even d 2 , there are many more 
possibilities [28, 29], but equation (22) implies that d 2 
must be odd. 

The stabilizer of the vector vi defined in (14) is the 
subgroup V.2 = {G e Q 2 ■ G{v\) = i>i}. Each transfor- 
mation H s H. 2 has the form 

H= (0 %) ' 

where H £ 7^2 is the nontrivial part. If C 2 = SO(d 2 ) then 
SO(rf 2 - 1) C n 2 . In the case d 2 = 7, if C 2 = MG 2 M T 
then 7-L 2 contains (up to the similarity M) the real 6- 
dimensional representation of SU(3) given by 

H=( TcU n im !H , (23) 
V —mi U reU J v ' 

where re U and im U are the real and imaginary parts of 
U G SU(3) (sec exercise 22.27 in [30]). 



F. Two generalized bits 

The joint state space of two 1S2 systems is denoted by 
<$2,2- The multiplicativity of the capacity (18) implies 
that £2,2 is equivalent to S4. However, we write £2,2 to 
emphasize the bipartite structure. 

In what follows, instead of using the standard rep- 
resentation for bipartite systems (3) we generalize the 
Bloch representation to two generalized bits. A state 
i>AB £ ^2,2 has Bloch representation "ipAB = [oj, P, C] 



9 



with 

a 1 = 2p(x i )-l 

& = 2p( yj )-l (24) 

C« = Ap(x i ,y j )-2p(x i )-2p(y j ) + l 

for i,j = l,...,d,2- Note that a = ipA and /3 = 
are the reduced states in the Bloch representation (13). 
The correlation matrix can also be written as C y = 
p(xi,yj) - p{xi,y~j) - p(Xi,yj) + p(x l ,y j ) 1 and character- 
izes the correlations between subsystems. Product states 
have Bloch representation 

{tpA®<PB) A = [<PA,<PB,<PA<Pb] > ( 25 ) 

with rank-one correlation matrix. In QT, where c?2 = 
3, two-qubit density matrices arc often represented by 
[a,/3,C] through formula (41). Definition (24) implies 

-1 <a\l3 j ,C ij < 1 . (26) 

The invertible map C[^ab\ = "<Pab defined by (24) 
also determines the Bloch representation of effects ft — 
O o C^ 1 . In particular, the tensor-product of two effects 
of the form (15) is 

(27) 

The map C also determines the action of reversible trans- 
formation in the Bloch representation. Since C is affinc 
but not linear, the action G = C o G o Cr l need not be 
linear. Identities (16, 25) and fi2 = imply that the 
maximally-mixed state in 52, 2 is ^2,2 = (A2j M2, A2A2) = 
0. This and (12) imply that transformations (^2,2 act on 
the generic vector [a,/3,C] as matrices. In particular, 
local transformations Ga,G b G Q2 act as 

{G A ®G B ) A [a,p,C\ = [G A a,G B /3,G A CG T B } . (28) 

Subsection IV E concludes that G2 consists of orthogonal 
matrices, and Lemma 8 shows that all transformations in 
C*2,2 are orthogonal, too. Orthogonal matrices preserve 
the norm of vectors, therefore all pure states ip G £2,2 
satisfy 

M 2 = H 2 + |/3| 2 +tr(C T C) = 3. (29) 

The constant in the right-hand side can be obtained by 
letting ijj = [a, a, aa T ] with \a\ = 1. 

G. Consistency in the subspaces of two generalized 

bits 

In this subsection we use a trick introduced in [19]: to 
impose the equivalence between a particular subspace of 
£2,2 and <S 2 (Requirement 3). 

Consider the unit vector v\ from (14) and the two 
distinguishable pure states ipo — v\ and <£>\ = —v\ 



from £2- The four pure states ip a .b — Pa ® fb G £2,2 
can be distinguished with the complete measurement 
Vt a fi = Vl Va ® Q, Vh where a, b <G {0,1}. Formula (27) 
implies 

f2 ,o[a,/3,C] = (l + a 1 +p 1 + C 1 ' 1 )/4, (30) 
tl ltl [a,p,C\ = (1 - a 1 - p l + C 1 ' 1 )^ . (31) 

Requirement 3 implies that the subspace 

S' 2 = {i> e S 2 , 2 ■ (n , + Oi,i)(V) = 1} , 

is equivalent to £2- By adding (30) plus (31), it becomes 
clear that a state ip = [a, (3, C] belongs to S 2 if and only 
if C 1 ' = 1. Moreover, if ip € S 2 , then it follows from 
ft 0l i(^) > and fii.oO/O > that a 1 = /3 1 . 

Theorem 5. The state space of a generalized bit has 
dimension three (d 2 = 3j. 

Proof. Recall that the case under consideration is odd d 2 
larger than one. The space S 2 C £2,2 is equivalent to £2, 
which is a ^-dimensional unit ball. If ^0,0 and c^i.i are 
considered the poles of this ball, then the equator is the 
set of states Veq such that fio,oC0eq) = (V"eq) = 1/2- 
Equations (30, 31) tell that equator states have a 1 = 
(3 1 = 0, and then 

(32) 

where a,/3,7,f e M^^ 1 and C £ R( d 2-i)x(d 2 -i)^ Con _ 
sider the action of Ga®^- for Ga G Q2 on an equator state 
ipeq- Since Q2 is transitive on the unit sphere, if 7 ^ 
then there is some Ga G G2 such that the correlation 
matrix transforms into 

which is in contradiction with (26). Therefore 7 = 0, and 
by a similar argument f = 0. 

The stabilizer of v\ is the largest subgroup H2 C £2 
which leaves £1 invariant (Subsection IV E). For any pair 
Ha,Hb G % the identity Q a ,b°(HA®H B ) = Q a ,b holds, 
which implies that if ip eq belongs to the equator (32) then 

(H A ®H B )^ cq r = [( tf° a ) ' ( Hb0 )' ( HaCHI ) 

also belongs to the equator. The equator is a unit ball of 
dimension <i 2 — 1. Since the set 

{(Ha 8> H B )(iP eq ) : H A ,H B £ H 2 } (33) 

is a subset of the equator, the dimension of its affine span 
is at most d® — 1. 

Consider the case C = 0. The normalization condition 
(29) implies |a| = |/3| = 1. The set {H A a : H A G H 2 }_has 
dimension c?2 — 1, and the same for {H B j3 : H B £ 0.2)- 
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Therefore the set (33) has dimension at least 2(d 2 — 1) 
generating a contradiction. 

Consider the case C ^ 0. The group action on C 
corresponds to the exterior tensor product H 2 ^ % = 
{H A _® H B ■ Ha, Hb G %%}. If d 2 > 3 and SO(d 2 - 1) 
C H 2 then 7^2 is irreducible in C d2_1 , and a simple 
character-based argument shows that H 2 H 2 is irre- 
ducible in (C^- 1 )® 2 (see page 427 in [30]). Hence the 
set 



{H A CHl:H A ,H B £U 2 } 



(34) 



has dimension (d 2 — l) 2 , which conflicts with the dimen- 
sionality requirements of (33). If d 2 = 7 and H 2 contains 
the representation of SU(3) given in (23), then the sub- 
group 



H = 



U 

u 



with U G SO(3) C SU(3) has two invariant C 3 subspaces. 
Therefore the invariant subspaces of 1-i 2 MH 2 have at least 
dimension 9, and independently of C, the set (34) has at 
least dimension 9, which conflicts with the dimensionality 
requirements of (33). So the only possibility is d 2 = 
3. □ 

From now on, only the case d 2 = 3 is considered. Sub- 
section IVE tells that SO(3) C Q 2 C 0(3), which implies 
that either Q 2 = 0(3) and U 2 = 0(2), or Q 2 = SO(3) 
and n 2 = SO(2). 

Let us see that the first case is impossible. The group 
H 2 = 0(2) is irreducible in C 2 , therefore 7l 2 M H 2 is ir- 
reducible in (C 2 )® 2 . Three paragraphs above it is shown 
that C 7^ 0, hence the set (34) has dimension (d 2 — l) 2 , 
which is a lower bound for the one of (33), which is larger 
than the allowed one (^2 — 1 = 2). 

Let us address the second case. The group H 2 = SO(2) 
is irreducible in R 2 but reducible in C 2 ; so the previous 
argument does not hold. The vector space of 2 x 2 real 
matrices decomposes into the subspace generated by ro- 
tations 



jj / cosf smv 
R + = 

V — sin v cos v 

and the one generated by reflections 

^ _ - cosv smv 
sin v — cos v 



(35) 



(36) 



where det R± = ±1. For any pair Ha, Hb G H 2 the ma- 
trix H A R+Hg is a rotation and the matrix HaR-H^ 
is a reflection; therefore the 2-dimensional subspaces 
(35) and (36) are invariant under j-L 2 E3 j-L 2 . Since the 
equator has dimension d 2 — 1 = 2, all matrices C / 
must be fully contained in one of the two subspaces 
spanned by (35) or (36), otherwise the dimension of the 
set (33) would be too large again. For the same reason 
a = j3 = 0. 



Depending on whether C is in the subspace generated 
by R + from (35) or by R- from (36), the states in the 
equator of S 2 are either ip^ q or ip~ q , where 



eq 




1 

cos v sin v 
=F sin v ± cos v 



The proportionality constants in C oc R± are fixed by 
normalization (29). It turns out that both the symmet- 
ric case and the antisymmetric case tp^ correspond 
to different representations of the same physical theory 
— that is, the corresponding state spaces (together with 
measurements and transformations) are equivalent in the 
sense of Subsection HE. To see this, define the linear 
map t : S 2 — > S 2 as f(ai, a 2 , a 3 ) T := {a\, a 2 , — a 3 ) T ; 
that is, a reflection in the Bloch ball. The equivalence 
transformation is defined as C := t <g> I (in quantum in- 
formation terms, this is a "partial transposition"). This 
map respects the tensor product structure, leaves the set 
of product states invariant, and satisfies £(^y = 
[19]. In other words: we have reduced the discussion 
of the antisymmetric theory to that of the symmetric 
theory [35], which will be considered for the rest of the 
paper. 

The orthogonality of the matrices in C/2,2 implies that 
S 2 is a 3-dimensional ball, and not just affinely related to 
it. Hence all states on the surface of the ball S 2 C 52,2 
can be parametrized in polar coordinates u G [0, 7r) and 
v G [0, 2ir) as 



i'{u,v) 



(37) 




1 

sin u cos v sin u sin v 
sin u sin v — sin u cos v 



These states cannot be written as proper mixtures of 
other states from S' 2 . It is easy to see that this implies 
that they are pure states in £2.2- 



H. The Hermitian representation 

In this subsection, a new (more familiar) representa- 
tion is introduced, where states in £2 are represented by 
2x2 Hermitian matrices. For any state ip G 6>2 in the 
standard representation (1), define the linear map 



a l - a 2 - a 3 



5>V. (38) 



The Pauli matrices 



1 

1 



-i 

1 



! G 



1 

-1 



together with the identity I constitute an orthogonal ba- 
sis for the real vector space of Hermitian matrices. In 
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terms of the Bloch representation, the map (38) has the 
familiar form 



will be useful. The Hcrmitian representation of a state 
tJjab = [a, /?, C] £ 5 2 ,2 is 




All positive unit-trace 2x2 Hermitian matrices can be 
written in this way with ip in the unit sphere. Since 
£2 is a 3-dimcnsional unit sphere, the set £[£2] is the 
set of quantum states. The extreme points of £[£2] are 
the rank-one projectors: each pure state ip £ S2 satisfies 
£[ip] = IV'XV'I) where the vector \ip) £ C 2 is defined up to 
a global phase. Effect (15) associated to the pure state 
(p e S2 is 

Sl v {i/,) = (il^oC- 1 ) (£[</-]) = tr (I^Xvl C[i/>]) . (39) 



£® 2 {iPab] 



(41) 



1 r 
4 



333 

«=1 3=1 *,J=1 



The action of local transformations Ga,Gb £ £7 2 on 
VUb e 5 2 , 2 is 

£ m [(G A ®G B ){ipAB)] = {Ua®U b )pab{Ua®U b ) ] (42) 

where pab = £® 2 {4>ab] and Ua,Ub £ SU(2) are related 
to Ga,Gb via (40). Now, we are ready to prove 

Theorem 6. The only GPT with rf 2 > 1 satisfying Re- 
quirements 1-5 is quantum theory. 



which shows that <S? C 



CH 
D 2,2- 



Note that the state 99 and its associated effect Q v are Proo j We start by reproducing an argument from [19] 
both represented by The action of a reversible 

transformation G £ Q2 — SO(3) in the Hermitian repre- 
sentation is 

£[G(V>)] = VC[il>]rf , 
where U £ SU(2) is related to G via 



A particular family of 
pure states in 6>2, 2 is ip{u) = tp(u,0) defined in (37). 
The Hermitian representation of i/j(u) is the projector 

C® 2 [i){u)} = \ip(u))(il>{u)\ onto the C 2 <g> C 2 -vcctor 



\tp(u)) = cos-|+) 



■sm-|-)®|-) 



3=1 



(40) 



and & l are the matrix components (equation VII. 5. 12 
in [26]). In summary, the generalized bit in all theories 
satisfying 0I2 > 1 and the requirements, is equivalent to 
the qubit in QT. 



I. Reconstructing quantum theory 

In this subsection, the main result of this work is 
proved. But before, let us introduce some notation. 

In QT, the state space with capacity c and the corre- 
sponding group of reversible transformations are 



S? = { P e C c) 
£ C Q = {U<E>U* 



: p > 0, trp = 1} , 
U £ SU(c)} . 



where |+) = (1, l) T /\/2 and |-) = (-l,l) T /\/2. 
From the Schmidt decomposition, it follows that 
all rank-one projectors in C 4x4 can be written as 
(Ua <£> Ub)\^>{u)){iIj{u)\{Ua ® UbY for some value of u 
and some local unitarics Ua, Ub € SU(2). Thus, all rank- 
one projectors are pure states in 5^2 • Their mixtures 

generate all of S®, therefore Q S^ 2 - 
Direct calculation shows that 

tr(C^m £®VD =\ + \ [« T « + W + tr(C T C')] 

(43) 

for any pair of states ifj = [a,j3,C] and ip' = [a',(3',C] 
from 5 2j 2- Lemma 8 shows that all G £ C/2,2 are orthog- 
onal matrices. Therefore, the Euclidean inner product 
between states, as in the right-hand side of (43), is pre- 
served by the action of any G £ £2,2 ■ Equality (43) 
maps this property to the Hermitian representation: any 
H £ Q22 preserves the Hilbcrt-Schmidt inner product 
between states: 



The joint state space of m generalized bits is denoted by 
iS 2 xm, and the corresponding group of reversible trans- 
formations by Q2xm. The Hermitian representation of a 
state ip £ 5 2 x m is defined to be £® m [ip], where £® m := 
£ <g> • • • <g> £, and £ is defined in (38). The map £® m acts 
independently on each tensor factor, hence it translates 
the tensor product structure from the standard represen- 
tation (4, 5, 7) to the Hcrmitian one. For example, if 
if £ S 2 is a pure state, then C® m {(p® m ] = \<p)(tp\® m ■ The 
notation 



tT[H(p)H(p')]=tT(pp') 



(44) 



£■ 



y 2 x 



1 [<S 2 X tr 

l °G 2 > 



for all p, p' £ S^2- 

For any pure state tp £ 5 2 , the rank-one projector 
\<p)(<p\ ® \<p}(f\ = £® 2 [<p <S> <p] is a pure state in S^ 2 : an< ^ 
tr(\ip)((p\ <g> \ip){ip\ p) = {% ® fl v ) o (£® 2 )- 1 (p) is a mea- 
surement on S^2- Any rank-one projector £ C 4x4 
is a pure state in 5^2, hence there is H £ Q^ 2 such 
that H(\p)(p\ <g) I^Xvl) = IV'XV'I- Composing the trans- 
formation H with the effect £l v <g> fl v generates the effect 
(fl v ® f2 y ) o (£® 2 ) -1 o H~ x , which maps any p £ to 



(£« 



tx[\ v M®\^MH-\p)] 



tr 



P\ 



(45) 
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where (44) has been used. In summary, every rank-one 
projector IV'XV'I G C 4x4 has an associated effect (45) 
which is an allowed measurement on S^ 2 ■, an d these gen- 
erate all quantum effects. 

We have seen that all quantum states are contained 
in Sf, but can there be other states? If so, the associ- 
ated Hermitian matrices should have a negative eigen- 
value (note that all states in the Hermitian representa- 
tion (41) have unit trace). If p has a negative eigenvalue 
and is the corresponding eigenvector, then the associ- 
ated measurement outcome (45) has negative probability. 
Hence, we conclude that = S^, and similarly for the 
measurements. 

All reversible transformations H G map pure states 
to pure states, that is, rank-one projectors to rank-one 
projectors. According to Wigner's Theorem [31], ev- 
ery map of this kind can be written as H(\ip)(tp\) = 
(U\ijj))(U\ip)) t, where U is either unitary or anti-unitary. 
If U is anti-unitary, it follows from Wigner's normal 
form [32] that there is a two-dimensional U -invariant sub- 
space spanned by two orthonormal vectors \9o)> € C 4 
such that U(t \6 ) +ti|0i}) equals either t o \0 o ) +ii|0i) 
or iie ls \8o) + ?oe~ ls |#i) for some s <E R. In both cases, 
U acts as a reflection in the corresponding Bloch ball, 
which contradicts Requirement 3 because we know that 
g 2 = SO(3). Therefore Qf C Q?. 

We know that Q^ 2 contains all local unitaries. Since 
this group is transitive on the pure states, it contains at 
least one unitary which maps a product state to an entan- 
gled state. It is well-known [33] that this implies that the 
corresponding group of unitaries constitutes a universal 
gate set for quantum computation; that is, it generates 
every unitary operation on 2 qubits. This proves that 

gf = gf. 

Consider m generalized bits as a composite system. 
From the previously discussed case of £2,2, we know 
that every unitary operation on every pair of general- 
ized bits is an allowed transformation on S^ x m . But two- 
qubit unitaries generate all unitary transformations [33] , 
hence Q^m C Q¥ xm . By applying all these unitaries to 
\(p}(ip\® m , all pure quantum states are generated, hence 
S® m C 5^ m . Reasoning as in the £2.2 case, for ev- 
ery rank-one projector IV'XV'I acting on C 2 , the asso- 
ciated effect which maps p € S¥ xm to tr(\ip)(ip\p) is an al- 
lowed measurement outcome on S^ x m . This implies that 
all matrices in Sl\ m have positive eigenvalues, therefore 
5 2 H xm = 5 2 Q m and Qf K m = £ 2 Q m . 

The remaining cases of capacities c that are not powers 
of two are treated by applying Requirement 3, using that 
S c C S2™ for large enough m. □ 

V. CONCLUSION 

We have imposed five physical requirements on the 
framework of generalized probabilistic theories. These re- 



quirements are simple and have a clear physical meaning 
in terms of basic operational procedures. It is shown that 
the only theories compatible with them are CPT and QT. 
If Requirement 4 is strengthened by imposing the conti- 
nuity of reversible transformations, then the only theory 
that survives is QT. Any other theory violates at least 
one of the requirements, hence the relaxation of each one 
constitutes a different way to go beyond QT. 

The standard formulation of QT includes two postu- 
lates which do not follow from our requirements: (i) the 
update rule for the state after a measurement, and (ii) the 
Schrodingcr equation. If desired, these can be incorpo- 
rated in our derivation of QT by imposing the following 
two extra requirements: (i) if a system is measured twice 
"in rapid succession" with the same measurement, the 
same outcome is obtained both times [4], and (ii) closed 
systems evolve reversibly and continuously in time. 

This derivation of QT contains two steps which de- 
serve a special mention. First, a direct consequence of 
Requirement 3 is that 52 is fully surrounded by pure 
states, which together with Requirement 4 implies that 
£2 is a ball. Second, this ball has dimension three, since 
d = 3 is the only value for which SO(d — 1) is reducible 
in C d . 

Modifications and generalizations of QT are of interest 
in themselves, and could be essential in order to construct 
a QT of gravity. Some well-known attempts [15, 16] have 
shown that straightforward modifications of QT's mathe- 
matical formalism quickly lead to inconsistencies, such as 
superluminal signaling [17]. This work provides an alter- 
native way to proceed. We have shown that the Hilbcrt 
space formalism of QT follows from five simple physical 
requirements. This gives five different consistent ways 
to go beyond QT, each obtained by relaxing one of our 
requirements. 
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Appendix A: Lemmas 

Lemma 1. In any state space S c , the only state ip G S c 
which is invariant under all reversible transformations 

G(ip) = ip for all G e Gc , (Al) 

is the maximally-mixed state [i c , defined in (II). 

Proof. Suppose rb g S c satisfies (Al). Any state can 
be written as a mixture of pure states: ip = J^klkipk- 
Normalization J„ dG = 1, condition (Al), the linearity of 
G, the purity of all iph, the definition of /j, c , and ^2 k qu = 
1, imply 

ip = [ ipdG= [ G{yj)dG 

= y^Qk I G{ip k ) dG = ^2qk fJ-c = Mc , 
k Jg ° k 
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which proves the claim. 



□ 



Lemma 2. If Q is a compact real matrix group, then 
there is a real matrix S > such that for each G £ Q the 
matrix SGS~ X is orthogonal. 

Proof. Since the group Q is compact, there is an invariant 
Haar measure [26], which allows us to define 



G T GdG . 



P 



Since each G is invertible, the matrix G T G is strictly 
positive, and P too. Define S = \f~P > where both 
S, S^ 1 are real and symmetric. For any G £ Q we have 
{SGS~ x y (SGS- 1 ) = I, which implies orthogonality. □ 



To prove the second part, let ip[ j ■ • ■ i ^4 be the states 
that are distinguished by the measurement, that is 
^a(V'b) = fia.b- Every ip' b can be written as a convex 
combination of pure states ip'b = J2k 1 k V'fe.fc- But effects 
are linear functions such that < Q(V0 < 1 for any state 
ip. Hence &(ip' b ) — is only possible if fl(ipb,k) = for all 
k, and similarly for the case £l(ip' b ) — 1. It follows that 

^a(i>b,l) = 0~a,b- □ 

Lemma 5. If S c is a state space with capacity c > 1 and 
jX c the corresponding maximally-mixed state, then there 
are c pure distinguishable states ipi , . . . , ip c £ <S C such that 



He 



1 c 

r — ^ 



a=l 



Lemma 3. If ha and hb are the maximally-mixed states 
of the state spaces Sa andSs, then the maximally-mixed 
state of the composite system Sab * s 



Hab = HA® HB 



Proof. The pure states ip A in Sa linearly span M d - 4+1 , and 
the pure states ip B in Sb linearly span R dfs+1 . Therefore, 

)R dB+1 . In 
of Sab can 



pure product states ip A ® ip B span 



particular, the maximally-mixed state (11 
be written as 



HAB 



a, 6 



(A2) 



where t a ,b £ R are not necessarily positive coefficients, 
and all tpa^b are P ure - From definition (1), the first 
component of the vector equality (A2) implies b t a j, = 
1. The maximally- mixed state is invariant under all re- 
versible transformations, in particular the local ones 



HAB 



dG a dG B {G A ®G B ){HAB) 

''A JQb 

Y^ta.h fdG A G A (^) ® [dG B GB(ip, 



= E*° 



b HA® HB = HA® HB 



where the same tricks from Lemma 1 have been used. □ 

Lemma 4. For every tight effect Q, there is a pure state 
tjj such that Q(ijj) = 1. Also, if a measurement Qi, ... ,fl n 
distinguishes n states tpi, . . . , tp n , then these states can be 
chosen pure. 

Proof. By definition, for each tight effect il there is a (not 
necessarily pure) state ip' such that = 1. Every i/j' 

can be written as a mixture of pure states tpk, that is 
V>' = SfeftV'fc wit h qk > and X)fc% = !• Effects 
are linear functions such that f2(V0 < 1 for any state 
Therefore, it must happen that all pure states ipk in the 
above decomposition satisfy £l(ipk) = 1- 



Proof. Since Si contains a single state the claim is triv- 
ially true for c = 1. Since 52 is the c^-dimensional unit 
ball, two antipodal points (p\ and (p 2 = — (p\ are pure, 
distinguishable and satisfy 



1, 

fj-2 = 



(A3) 



Now, consider the joint state space of n generalized 
bits, denoted iS 2 x ™- Lemma 3 and (A3) imply that the 
maximally-mixed state of iS 2 x ™ is 



M(n) 



= - E 

On / j 



® 'far, 



(A4) 



a;G{l,2} 



The states (p ai ® ■ ■ ■ ® ip an £ S 2 x™ for all oi, . . . , a n £ 
{1,2} arc perfectly distinguishable by the corresponding 
product measurement, hence the capacity of iS 2 xn, de- 
noted c n . satisfies 



> 2" . 



(A5) 



Let (fii, . . . , fi c „) be a complete measurement which dis- 
tinguishes the states ipi, . . . ,i\) Cn £ 5 2 xn. According to 
Lemma 4 these states can be chosen to be pure. Since 
Sfc=i ^k(H(n)) = 1) there is at least one value of k, de- 
noted ko, such that 



^k {H(n)) < 1/Cn 



(A6) 



The state ipi £ S 2 from (A3) is pure, hence ipf n e5 2 » 
is pure too. Requirement 4 tells that there is a reversible 
transformation G acting on such that G{tpk Q ) = 

ipf n . The measurement (fii oG" 1 , . . . , Q Cn oG~ 1 ) distin- 
guishes the states G(ipi), . . . , G(ip Cn ). Inequality (A6), 
the invariance of H(n) under G, expansion (A4), the pos- 
itivity of probabilities, and (f2fc o G^ 1 )(^f ") = 1, imply 



1 



- > (Sli.oG- 1 )^)) 

= ^E (O^oG- 1 )^ 

ai£{l,2} 
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This and (A5) imply c„ = 2". This together with (A4) 
shows the assertion of the lemma for state spaces whose 
capacity is a power of two. The rest of cases are shown 
by induction. 

Let us prove that if the claim of the lemma holds for 
a state space with capacity c, with c > 1, then it holds 
for a state space with capacity c — 1 too. The induc- 
tion hypothesis tells that there is a complete measure- 
ment (f2i, . . . , f2 c ) which distinguishes the pure states 
ipi,...,ip c G S c , and [i c = \ J2t=i € S c is the cor- 
responding maximally-mixed state. Requirement 3 tells 
that the state space <S c _i is equivalent to 



5' 



{V>e<S c |Q 1 (V0 + --- + o c _ 1 (V0 = 1} 



According to Requirement 3, for each G G Q c -i there is 
G' G Gc which implements G on S' c _ 1 . Hence G'(ipk) G 
S' c _ 1 for k = 1, . . . , c— 1, which implies (0 C o G')(ipk) = 
for those k, and 



1 



= -V(J! c o G')^ k ) = i(n c o g'XVc) , 

and (f2 c o G')(ip c ) = 1. Requirement 3 tells that the set 

S[ = {0> G 5 C | Q C (V>) = 1} 

is equivalent to <Si, which contains a single state. This 
and Q c (ipc) = 1 imply that G'(ip c ) = "0c and then 
G'{ii' c _i) = fj/ c _i, where we define 



c-l 



k=l 



r 



c- 1 



0c es;_ 



For any G G 
G'(aCi) = /4 



Gc-i the corresponding G" satisfies 
. Due to Lemma 1 the invariant state 
fi' c _i must be the maximally- mixed state in S' c _ 1 , which 
has the claimed form, and Requirement 3 extends this to 

Sc-i. D 

Lemma 6. Let S be a state space such that the subset of 
pure states V is a connected topological manifold. If the 
corresponding group of transformations Q is transitive on 
V , then the largest connected subgroup C C Q is transitive 
on V , too. 

Proof. This proof involves basic notions of point set 
topology. 

Since Q is compact, it is the union of a finite number 
of (disjoint) connected components Q = C\ U • • ■ U C n . 
If n = 1 the lemma is trivial. Let C be the connected 
component d containing the identity matrix I, which is 
the largest connected subgroup of Q. Each connected 
component Ci is clopen (open and closed), compact and 
a coset of the group: d = G; o C for some Gi G £7 [30]. 

Pick ip G V, and consider the continuous surjective 
map / : Q -> V, defined by f{G) = G{ip) G V. Since 



C is compact f(C) C V is compact too. Since the mani- 
fold V is in particular a Hausdorff space, f(C) is closed. 
Consider the set V = f~ 1 {f(C)). If two group elements 
G, H G G are in the same component, that is G _1 i? G C, 
then G G T> implies H G T>, using that C is a normal 
subgroup of Q . This implies that T> is the union of some 
connected components Ci, and so is Q\D. In particular, 
Q\D is compact, thus f(Q\T>) is compact, hence closed. 
Therefore, f(C) = V\f(G\V) is open. We have thus 
proven that f(C) ^ is clopen. Since V is connected, it 
follows that f (C) = V. □ 

The following lemma shows that there are transforma- 
tions for two generalized bits which perform the "classi- 
cal" swap and the "classical" controlled-not in a partic- 
ular basis. Note that these transformations do not nec- 
essarily swap other states that are not in the given basis, 
as in QT. However, they implement a minimal amount of 
reversible computational power which exceeds, for exam- 
ple, that of boxworld, where no controlled-not operation 
is possible [14]. 

Lemma 7. For each pair of distinguishable states 
<Po,cpi G £>2, there are transformations G swap ,G cno t G 
02,2 such that 

Gswap {}Pa ® (fib) = Lpb<E)(fia , (A7) 
G cno t(<p a <£> fb) = fa®fa®b , (A8) 

for all a, b G {0, 1}, where © is addition modulo 2. 

Proof. Let (f2o,f2i) be the measurement which distin- 
guishes ((p ,ip{), that is £l a (<Pb) = <5 a ,6- Define ip a ^ = 
fa <£> fb and Q, a ,b = r2 a (g) Oj, for a, b G {0, 1}. Define 



S' 3 = {0gS 2 , 2 
S' 2 = {vie5 2 , 2 



(Oo,i + Oi, + Oi,i)(V)) = 1}, 
(Oo,i + ni,o)(V0 = 1} , 



and note that S' 9 C Si C <So 



At the end of 



Subsection IV B, it is shown that two distinguishable 
states ifotfi G <S 2 have Bloch representation satisfy- 
ing ip = —ipi- According to Requirement 4 there 
is G G Q2 such that G(tpo) = tpi, and by linearity, 
G((fi) = —G(tpo) = —ipi = ipQ. Requirement 3 im- 
plies that there is a transformation Gg Wap for S' 3 such 



that GL,Wo,i) 



ipi,0 and G£ wap (0i iO ) = Vo,i- Ac- 
cording to Lemma 5, the maximally- mixed state in S' 3 
can be written as /i' 3 = (0o,i + V'l.o + V'i,i)/3- Equalities 
G^wap (M3) = and G^ wap (0 o ,i + ^1,0) = 0o,i + Vu,o im- 
ply that Gg W ap (V'i,i) = 0i, i- Using Requirement 3 again, 



swap 

there is a reversible transformation G swa p G Q2, 2 which 
implements Gg Wap in the subspacc 53 . Repeating the ar- 
gument with the maximally- mixed state (now in S2.2) we 
conclude that G swa p('0i 1) = 01 i> hence G swa p satisfies 
(A7). 

The existence of G cno t is shown similarly, by exchang- 
ing the roles of ^o.i and 0i,i. □ 
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Lemma 8. Reversible transformations for two general- 
ized bits in the Block representation (24) are orthogonal: 

G2,2 C 0(d 4 ) . 

Proof. In Subsection IV F the Bloch representation for 
two generalized bits is defined, and it is argued that re- 
versible transformations Q2.2 act on [a, f3, G] as matrices. 
In particular, local transformations (28) are 

( G A \ 

(G A ®G B ) A = G B , (A9) 

\ G A ® G B J 

where each diagonal block acts on an entry of [a,j3,C], 
and GatGb € Qi- In Subsection IV E it is argued that 
Q2 Q 0(c?2), hence local transformations (A9) are orthog- 
onal. 

Lemma 2 shows the existence of a real matrix S > 
such that for any G £ G2,2 the matrix SGS' 1 is orthog- 
onal. In particular 

[S ■ (G A <8> G S ) A • S- 1 ] T [S ■ (G A <8> G B ) A ■ S- 1 ] = I , 

which implies the commutation relation 

S- {G A ®G B ) h = (G a ®GbY -S (A10) 

Subsection IV E concludes that d 2 is odd, and that 
SO(d 2 ) C £ 2 except when d 2 = 7, where MG 2 M^ C Q 2 
and G 2 is the fundamental representation of the smallest 
exceptional Lie group [30]. For d 2 > 3 these groups act 
irreducibly in C d2 , hence Q 2 acts irreducibly in C d2 too 
[30]. The first two diagonal blocks in (A9) arc irreducible. 
The exterior tensor product of two irreducible representa- 
tions (in C d ) is also an irreducible representation, hence 
the third diagonal block in (A9) is also irreducible. This 
together with (A10) implies that 

/ al \ 
S = 61 

y si J 

for some a, b, s > (Schur's Lemma [30]). 

According to Lemma 7, for each unit vector a € 1S2 
there is a transformation G SW ap € £2,2 such that 

G SW ap [a, 0,0] 



= G swap ([a, a, aa T ] + [a, -a, -«a T ]) /2 
= ([a,a,aa T ] + [-a,a,-aa T ]) /2 
= [0,a,0] . 

Since S'GgwapS' -1 is orthogonal, the vectors [a, 0,0] and 
[0, ba~ 1 a, 0] have the same modulus, hence a = b. Also, 
there is a transformation G cno t £ ^2,2 such that 

G cn ot [0,0, aa T ] 
= G cno t ([a, a, aa T ] + [—a, —a, aa T ]) /2 
= ([a,a,aa T ] + [-a,a,-aa T ]) /2 
= [0,a,0] . 

Since SGcnotS 1 1 is orthogonal, the vectors [0, 0, aa T ] and 
[0,fes _1 a, 0] have the same modulus, hence s = b. Con- 
sequently S = al, and the claim follows. □ 



Lemma 9. The results of this paper also hold if Require- 
ment 5 is replaced by Requirement 5'. 



Proof. Assume Requirement 5', but not Requirement 5. 
It follows that Si contains a single state only: if it 
contained more than one state, there would exist some 
ip G Si which is not completely mixed, which would then 
be distinguishable from some other state, contradicting 
that Si has capacity 1. Requirement 5 is used in the proof 
of Theorem 1. This proof is easily modified to comply 
with Requirement 5' instead: adopting the notation from 
the proof, the state <£> m ; x is not completely mixed, thus 
distinguishable from some other state. This proves ex- 
istence of some Cl with the claimed properties, and the 
other arguments remain unchanged, proving that 6> 2 can 
be represented as a unit ball. All pure states in £2 are 
not completely mixed, hence have a corresponding tight 
effect which is physically allowed. But for every state on 
the surface of the ball, there exists only one unique tight 
effect which gives probability one for that state. Hence, 
all these effects must be allowed, and since they generate 
the set of all effects, this proves Requirement 5 for use 
in (14) and the rest of the paper. □ 



